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Abstract 

We study the complexity of valued constraint satisfaction problems (VCSP). A problem from VCSP 
is characterised by a constraint language, a fixed set of cost functions over a finite domain. An instance 
of the problem is specified by a sum of cost functions from the language and the goal is to minimise the 
sum. We consider the case of so-called conservative languages; that is, languages containing all unary 
cost functions, thus allowing arbitrary restrictions on the domains of the variables. This problem has 
been studied by Bulatov [LICS'03] for {0, oo}-valued languages (i.e. CSP), by Cohen ef aZ. (AIJ'06)for 
Boolean domains, by Deineko et al. (JACM'08) for {0, l}-valued cost functions (i.e. Max-CSP), and 
by Takhanov (STAGS' 10) for {0, cxaj-valued languages containing all finite-valued unary cost functions 
(i.e. Min-Cost-Hom). 

We give an elementary proof of a complete complexity classification of conservative finite-valued 
languages: we show that every conservative finite-valued language is either tractable or NP-hard. This 
is the first dichotomy result for finite-valued VCSPs over non-Boolean domains. 
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1 Introduction 



The constraint satisfaction problem is a central generic problem in computer science. It provides a common 
framework for many theoretical problems as well as for many real-life applications, see [ 1 ] for a nice survey. 
An instance of the constraint satisfaction problem (CSP) consists of a collection of variables which must be 
assigned values subject to specified constraints. CSP is known to be equivalent the problem of evaluating 
conjunctive queries on databases [i], and to the homomorphism problem for relations structures [i]. 

An important line of research on the CSP is to identify all tractable cases; that is, cases that are recog- 
nisable and solvable in polynomial time. Most of this work has been focused on one of the two general 
approaches: either identifying structural properties of the way constrains interact which ensure tractabil- 
ity no matter what forms of constraint are imposed [-+], or else identifying forms of constraint which aie 
sufficiently restrictive to ensure tractability no matter how they are combined [5,3]. 

The first approach has been used to characterise all tractable cases of bounded-arity CSPs: the only class 
of structures which ensures tractability (subject to a certain complexity theory assumption, namely FPT ^ 
W[l]) are structures of bounded tree-width modulo homomorphic equivalence [u, /]. The second approach 
has led to identifying certain algebraic properties known as polymorphisms [8] which are necessary for a 
set of constraint types to ensure tractability. A set of constraint types which ensures tractability is called a 
tractable constraint language. 

Schaefer in his seminal work [ ] gave a complete complexity classification of Boolean constraint lan- 
guages. The algebraic approach based on polymorphisms [10] has been so far the most successful tool in 
generalising Schaefer's result to languages over a 3-element domain [1 1], conservative languages [12], lan- 
guages comprising of a single binary relation without sources and sinks [13] (see also [14]), and languages 
comprising of a single binary relation that is a special triad [15]. The algebraic approach has also been 
essential in characterising the power of local consistency [16] and the "few subpowers property" [17], the 
two main tools known for solving tractable CSPs. The ultimate goal in this Une of research is to answer the 
Dichotomy Conjecture of Feder and Vardi, which states that every constraint language is either tractable or 
NP-hard [3]. We remark that there are other approaches to the dichotomy conjecture; see, for instance, [1] 
for a nice survey of Hell and Nesetfil, and [18] for a connection between the Dichotomy Conjecture and 
probabilistically checkable proofs. 

Since in practice many constraint satisfaction problems are over-constrained, and hence have no solu- 
tion, or are under-constrained, and hence have many solutions, soft constraint satisfaction problems have 
been studied [' ]. In an instance of the soft CSP, every constraint is associated with a function (rather 
than a relation as in the CSP) which represents preferences among different partial assignments, and the 
goal is to find the best assignment. Several very general soft CSP frameworks have been proposed in the 
literature [20, 21]. In this paper we focus on one of the very general frameworks, the valued constraint 
satisfaction problem (VCSP) [20]. 

Similarly to the CSP, an important line of research on the VCSP is to identify tractable cases which are 
recognisable in polynomial time. Is is well known that structural reasons for tractability generalise to the 
VCSP [19]. In the case of language restrictions, only a few conditions are known to guarantee tractability 
of a given set of valued constraints [22, 23]. 

Related work Creignou et al. have studied various generalisations of the CSP to optimisation problems 
over Boolean domains [ ' ], see also [25, 26]. Cohen et al. have completely classified the complexity of 
valued languages over Boolean domains [ ]. 

Bulatov, making use of the techniques of universal algebra, has completely classified the complexity 
of conservative {0, oo}-valued languages, i.e. languages containing all possible {0, oo}-valued unary cost 
functions. (Note that {0, oo}-valued languages are just normal constraint languages, that is, sets of rela- 
tions.) Using the algebraic approach, Takhanov has completely classified the complexity of {0, oo}-valued 
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languages containing additionally all finite-valued unary cost functions [27]. (Note that this problem is 
equivalent to the minimum-cost homomorphism problem.) It is not clear how to use the algebraic methods 
of Bulatov and Takhanov for our problem as the algebraic methods developed so far work only for {0, oo}- 
valued VCSPs, that is, CSPs. (One possible way might be to use results of Cohen et al. [28], but our results 
and techniques do not rely on results in [2S].) 

Deineko et al. [29] have completely classified the complexity of conservative {0, l}-valued cost func- 
tion languages. ' (Note that with respect to polynomial-time solvability {0, l}-valued VCSPs and Max-CSP 
are equivalent.) The crucial result in [ ] is to show that if a conservative {0, l}-valued language F is not 
submodular with respect to any order, then F can express (without changing the complexity) language L' 
that is not submodular with respect to any order and L' consists of at most 3 binary cost functions over a 
domain of size at most 4. In the {0, l}-valued case, there are only finitely many of these cost functions, and 
a cleverly -pruned computer search had shown that all such languages are intractable [29]. It is not clear 
how to use the techniques from [29] for the general VCSP framework with arbitrary costs. 

Raghavendra [^0] and Raghavendra and Steurer [31] have shown how to optimally approximate any 
finite-valued VCSP 

Contributions We prove a dichotomy theorem for finite-valued conservative languages: we show that 
a conservative finite-valued language F is tractable if and only if F admits a symmetric tournament pair 
multimorphism [23]. This result is known for Boolean conservative finite-valued languages [ ]. The 
Boolean case has been resolved using the notion of multimorphisms [2_]. We will also use multimorphisms 
here to derive the first complete complexity classification over domains of arbitrary size. To compare our 
result with [29], we show that even in the more general framework of finite-valued languages (as opposed 
to {0, l}-valued languages), there is only one reason for tractability, which is the same as in [ ].- 

Despite the fact that we use the notion of multimorphisms, our proofs are elementary and largely self- 
contained. We do not rely on the heavy algebraic machinery that has been used in the complexity classi- 
fication of conservative {0, cxoj-valued languages by Bulatov [12], and in the complexity classification of 
{0, oo}-valued languages containing additionally all finite-valued unary cost functions by Takhanov [27]. 

Naturally, our approach is similar to the one of Bulatov and Takhanov: the complexity of Boolean 
conservative languages is known, and hence given a non-Boolean language, we explore the interactions 
between different 2-element subdomains. Given a conservative finite- valued language F, we will investigate 
properties of a certain graph Gr associated with the language and cost functions expressible over F. The 
graph is finite, but not easy to construct. We show that F is tractable if and only if Gr satisfies certain 
properties. Thus we obtain a dichotomy theorem. However, in order to test whether a given F is tractable, 
we do not need to construct Gy as it follows from our result that we just need to test for the existence of a 
certain multimorphism of F. 

Even though our results are formulated as valued constraint satisfaction problems, they apply to various 
other optimisation frameworks that are equivalent to valued constraint problems such as Gibbs energy 
minimisation, Markov Random Fields and other graphical models [32, 33]. 

Organisation of the paper The rest of the paper is organised as follows. In Section 2, we define valued 
constraint satisfaction problems (VCSPs), conservative languages, multimorphisms and other necessary 
definitions needed throughout the paper. We state our results in Section 3 and prove them in Section 4. 

' In fact, results in [29] are stronger in two ways. Firstly, Deineko et al. have classified the complexity of languages containing 
the so-called fixed-value constraints; that is, constraints of the form "x=d", where a:: is a variable and a is a domain value. Secondly, 
Deineko et al. have considered the weighted version of the problem, which allows the range of every cost function / to be {0, c/}, 
where c/ depends on /. (The {0, l}-valued case is the special case when c/ = 1 for every cost function /.) 

^It follows from [\"^] that in the finite-valued case, having a symmetric tournament pair multimorphism is the same as being 
submodular with respect to some order. 
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Finally, in Section 5 we discuss the general case, which is still open. We review some known results and 
state our conjecture for the general case. 



2 Background and notation 

We denote by R+ the set of all non-negative real numbers. We denote = ]R_|_ U {cxo} with the standard 
addition operation extended so that for all a G M+, a + oo = oo. Members of are called costs. 

Throughout the paper, we denote by D any fixed finite set, called a domain. Elements of D are called 
domain values or labels. For distinct labels a, 6, ... in Z) we denote Da = {a}, Dab = {o, b} and so on. 

A function / from D™ to M+ will be called a cost function on D of arity m. If the range of / lies 
entirely within M, then / is called a finite-valued cost function. If the range of / is {0, oo}, then / is 
called a crisp cost function. If the range of a cost function / includes non-zero finite costs and infinity, 
we emphasise this fact by calling / a general-valued cost function. Let / : D"'' — >■ M+ be an m-ary cost 
function /. We denote dom/ = {a; G | f{x) < oo} to be the effective domain of /. Functions / of 
arity m = 2 are called binary. 

A language is a set of cost functions with the same set D. Language T is called finite-valued (crisp, 
general-valued respectively) if all cost functions in T are finite-valued (crisp, general-valued respectively). 
A language F is Boolean if\D\ =2. 

Definition 1. An instance X of the valued constraint satisfaction problem (VCSP) is a function D^ —> R_|_ 
given by 



It is specified by a finite set of nodes V, finite set of terms T, cost functions ft : D™' — ?• or arity nit ond 
indices i(t, k) € V for t £ T, k = 1, . . . , mt. A solution to X is an assignment x G D^ with the minimum 
cost. 

We denote by VCSP(r) the class of all VCSP instances whose terms ft belong to T. A finite language 
r is called tractable if VCSP(r) can be solved in polynomial time, and intractable if VCSP(r) is NP- 
hard. An infinite language V is tractable if every finite subset F' C F is tractable, and intractable if there is 
a finite subset F' C F that is intractable. 

Intuitively, a language is conservative if one can restrict the domain of any variable to an arbitrary 
subset of the domain. 

Definition 2. Language F is called conservative ifV contains all finite-valued unary cost functions u : D ^ M+. 

We remark that in the crisp case, conservative languages are defined differently in the literature: a crisp 
language is called conservative if it contains all possible {0, oo}-valued unary cost functions In this 
paper we always use Definition 2, unless explicitly referring to the crisp case. 

Definition 3. Let (n, U) be a pair of operations, where n,\J : D x D D. 

• Pair (n, U) is called a multimoiphism /or yM?ic?/o?i / : D"^ if 




f{xny) + f{xUy)<f{x) + f{y) 



Vx, y G domf 



(1) 



where operations U are applied component-wise. 



• Pair (n, U) is called conservative if {ar\b,a\Jb} = {a, h} for all a,b £ D. 
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• Pair (n,U) is called a symmetric tournament pair (STP) if it is conservative and both operations 
n, U are commutative, i.e. a □ 6 = 6 □ a and a U 6 = 6 U a for all a,b € D. 

We say that (fl, U) is a multimorphism of language F, or F admits U), if all cost functions / € F 
satisfy (1). Using a polynomial-time algorithm for minimising submodulai" functions, Cohen et al. have 
obtained the following result: 

Theorem 4 ([23]). If a language F admits an STP, then F is tractable. 

Cohen et al. have completely classified the tractability of Boolean languages, i.e. the case \D\ = 2. The 
following is a simple corollary of their classification: 

Theorem 5 ([^^]). Let T be a Boolean conservative finite-valued language. IfT admits an STP then it is 
tractable; otherwise it is NP-hard. 

The main result of this paper is a generalisation of this result to non-Boolean domains. 

Finally, we define the important notion of expressibility, which captures the idea of introducing aux- 
iliary variables in a VCSP instance and the possibility of minimising over these auxiliary variables. (For 
crisp languages, this is equivalent to implementation [25].) 

Definition 6. A cost function f : — )■ is expressible over a language F if there exists an instance 
X € VCSP{T) with the set of nodes V = {1, . . . ,m,m + 1, . . . ,m + k} where k > such that 

f{x) = mill Costx{x,y) \fx € D"^ 

We define F* to be the expressive power o/F; that is, the set of all cost functions f such that f is expressible 
over F. 

The importance of expressibility is in the following result: 

Theorem 7 ([22]). For any language F, F is tractable iffV* is tractable. 

It is easy to observe and well known that if U) is a multimorphism of F, then it is also a multimor- 
phism of F* [22]. 

From now on we assume that F is a fixed conservative language. 

3 Our results 

In this section, we relate the complexity of a conservative language F to properties of a certain graph Gy 
associated with F. 

Consider undirected graph Gr = {P, E) where the set of nodes is P = {(a, b) \ a,b G D, a ^ b} and 
the set of edges E is defined as follows: there is an edge between (a, b) G P and (a', b') G P iff there exists 
binary cost function / G F* such that 

fia,a') + fib,b')>fia,b') + fib,a'), (a, 6'), (&, a') G dom/ (2) 

Note that Gr may have self-loops. For node p G P we denote the self-loop by {p,p}. We say that 
edge {(a, 6), (a', b')} G E is soft if there exists binary / G F* satisfying (2) such that at least one of the 
assignments (a, a'), (6, b') is in dom/. Edges in E that are not soft are called hard. For node p = {a,b) ^ P 
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we denote p = (6, o) € P. Note, a somewhat similai^ graph (but not the same) was used by Takhanov [27] 
for languages F containing crisp functions and finite unary terms. 

We denote M C P to be the set of vertices (a, 6) € P without self-loops, and M = P — M to be the 
complement of M. It is not difficult to show that (o, b) € M iff (6, a) G M (see next section). 

Deflnition 8. A conservative pair of operations (n,U) : D x D ^ D is called an STP on M if it is 
commutative on M, i.e. a □ 6 = 6 □ a and a U 6 = 6 U a for all (a, b) G M. 

Our main result is the following 

Theorem 9. 

(a) If E has a soft self-loop {(o, b), (a, 6)}, then T is NP-hard. 

(b) If E does not have soft self-loops, then T admits an STP on M. 

A proof is given in the next section. When combined with the result of Cohen et al. [23], Theorem 9 
gives a dichotomy result for conservative finite-valued languages: 

Corollary 10. Let T be a conservative finite-valued language. If T admits an STP, then T is tractable; 
otherwise T is NP-hard. 

Proof. Consider the graph Gy associated with F. If Gy contains a soft self-loop, then, by Theorem 9(a), F 
is intractable. Suppose that Gr does not contain soft self-loops. As F is finite-valued, Gy cannot have hard 
self-loops. Therefore, M is empty and A/ = P. By Theorem 9(b), F admits an STP. The tractability then 
follows from Theorem 4. □ 

Remark 1 Note that we can determine whether F is tractable by testing the existence of an STP of F. For 
a fixed finite D and F, there are only finitely many potential STPs. We do not need to build the graph Gr, 
whose properties depend on F*. 

Remark 2 Our results can be easily extended to cost functions with the range M; that is, allowing finite 
negative costs. This is due to the fact that for any finite language F, we can add a finite cost to all cost 
functions from F to make the costs non-negative and this does not change the complexity of F. 
Remark 3 Let Ud{x) = if x = d and tirf(x) = c if x ^ d, where c € M+ is a fixed non-zero 
cost. Cost functions Ud, where d £ D, are called fixed-value cost functions. It is easy to see that our 
results classify the complexity of fixed-value languages. Indeed, as fixed-value languages are a subset of 
conservative languages, the tractability result remains the same, and the finite- value case of Theorem 9(a) 
remains valid as well. However, this is not the case for crisp languages where a complexity classification 
of crisp languages including crisp fixed-value cost functions would imply a complexity classification of 
all crisp languages [ ], where crisp fixed-value cost functions are defined as Ud{x) = if x = d and 
Ud{x) = ooif X ^ d. 

4 Proof of Theorem 9 

In Section 4. 1 we will first prove part (a). Then in Section 4.2 we will prove some properties of Gr assuming 
that Gr does not have self-loops. Using these properties, we will construct an STP on M in Section 4.3. 

''Roughly speaking, the graph structure in [27] was defined via a "min" polymorphism rather than a (min, max) multimor- 
phism, so the property {p, q} £ E ^ {p, q} £ E (that we prove for our graph in the next section) might not hold in Takhanov's 
case. Also, in [ 27] edges were not classified as being soft or hard. 
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4.1 NP-hard case 



In this section we prove Theorem 9(a). From the assumption, there is a binary / € T* such that /(a, a) + 
f{b,b) > f{a,b) + f{b,a), and at least one of the assignments {a,a), {b,b) is in dom/. First, let us 
assume that both {a, a) and (6,6) are in dom/. Clearly, g € F*, where g{x,y) = f{x,y) + f{y,x) 
has the following property: g{a,a),g{b,b) > g{a,b) = g{b,a). Let a = g{a,a) and p = g{b,b). If 
a / /3, let a < /3 (the other case is analogous). Using unary cost functions with cost (/5 — a)/2, we 
can construct /i G F* satisfying h{a,a) = h{b,b) > h{a,b) = h{b,a). Now if h{a,a) = h{b,b) = 1 
and h{a, b) = h{b, a) = 0, this would correspond to the Max-SAT problem with XOR clauses, which is 
NP-hard ']. Since adding a constant to all cost functions and scaling all costs by a constant factor do not 
affect the difficulty of solving a VCSP instance, and F is conservative, we can conclude that F is intractable. 

Now suppose that exactly one of the assignments (a, a), (6, b) is in dom/; without loss of generaUty, 
let us assume that (a, a) G dom/ and (6, b) ^ dom/. Thus g ^T, where g{a, a) = g{a, b) = g{b, a) = 
and g{b, b) = oo. Using unary cost functions, we can encode the maximum independent set problem in 
graphs, a well-known NP-hard problem [35]: every vertex is represented by a node with domain {a, b} (a 
represents not in the set, b represents in the set); an edge between two vertices imposes a term with the cost 
function g between the corresponding two nodes. For every node, there is a unary term with cost function 
h, where h{a) = 1 and h{b) = 0. It is clear that minimising the number of nodes assigned a is the same as 
maximising the number of nodes assigned b, thus finding a maximum independent set in the graph. □ 

4.2 Properties of graph Gr 

From now on we assume that E does not have soft self-loops. Our goal is to show that F admits an STP on 



In the lemma below, a path of length A; is a sequence of edges {po,Pi}, {^1,^2}) • • • , {Pk~i,Pk}, where 
{pi~i,Pi} € E. Note that we allow edge repetitions. A path is even iff its length is even. A path is a cycle 
if po =pk.lfX CP then {X, E[X]) denotes the subgraph of (P, E) induced by X. 

Lemma 11. Graph Gr = {P, E) satisfies the following properties: 

(a) {p, q} ^ E implies {p, q} £ E and vice versa. The two edges are either both soft or both hard. 

(b) Suppose that {p, q} £ E and {q, r} € E, then {p, f} € E. If at least one of the first two edges is soft 
then the third edge is also soft. 

(c) For each p £ P, nodes p and p are either both in M or both in M. 

(d) There are no edges from M to M. 

(e) Graph (M, E[M]) does not have odd cycles. 

(f) Graph (M, E[M]) does not have even paths from (a, b) to (6, a) where (a, b) G P. 

(g) Nodes p £ M do not have incident soft edges. 

Proof, (a) Follows from the definition. 

(b) Let p = {ai,bi), q = (02, 62) and r = (03, 63). From the definition of the graph, let f,g £ T* be 
binary cost functions such that (*) /(ai, 02) + /(6i, 62) > /(ai, 62) + /(6i, 02) and 5(02, 03) + 9(62, 63) > 
g{0'2, 63) + 5(62, ^s)- Without loss of generality, we can assume that 



M. 



f{ai,a2) = a, f{aiM) 



/(6i,a2) = 7, f{bi,b2) = a' 
/(62,a3)=7> 9(62,63)=^' 



(3) 
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This can be achieved by replacing / with /'(x, y) = f{x, y) + /(y, x) and adding a constant, and similarly 
for g; condition (*) and the complexity of F are unaffected. From (*) we get a + a' > 27; thus, by adding 
unary terms to / we can ensure that a > 7, a' > 7 and /(x, y) > ^ for all x,y £ D. Similarly, we can 
assume that /3 > 7, /?' > 7 and g{x, y) > for all x,y G D. 

Let h{x,z) = min^(z D{f{x,y) + u^a^Miv) + 9{y,z)}, where u^a^^iy) = if y G {02,62}, and 
u^a2,b2}iy) — C Otherwise. Here C is a sufficiently large constant, namely C > max{a, a'}+max{/3, /3'}. 
From the definition of h and (3) we get /i(ai, 03) = h{bi, 63) = 27. Moreover, h{ai, 63) > min{Q;, /3' , C} 
and 03) > min{a', /3, C}, where /i(ai, 63) > C iff a = /5' = 00, and 03) > C iff a' = /3 = 00. 
Therefore, /i(ai, 63) + h{bi,a^) > /i(ai, 03) + 63), and so {p, f} G E^. 

Now suppose that at least of one of the edges {p, g}, {g, r} is soft, then we can assume that either 
(a, a') / (00,00) or (/3,/3') / (00,00). In each case either h{ai,h'i) < C or h{bi,a^) < C, and thus 
{p, f} is soft. 

(c) Follows from (a). 

(d) Suppose {p, q} £ E and q £ M. The latter fact implies {q, q} G ii^, so by (b) we have {p, q} G E. 
From (a) we also get {q, p} G E. Applying (b) again gives {p, p} G E. Thus p e M. 

(e) We prove by induction on k that (M, -E'[M]) does not have cycles of length 2k + 1. For A; = the 
claim is by assumption (nodes of M do not have self-loops). Suppose it holds for A; > 0, and suppose that 
(M, E[M]) has a cycle V, {p, q}, {q, r}, {r, s} of length 2fc + 3 where V is path from s G M to p G M 
of length 2k. Properties (b) and (a) give respectively {p, f} G and {f, s} G E. Applying (b) again gives 
{p, s'} G E, therefore (M, iiJ[Af]) has a cycle V, {p, s} of length 2k + 1. This contradicts to the induction 
hypothesis. 

(f) Suppose (M, -^[M]) has an even path V, {p, q}, {q, f} where V is an even path from r to p. Property 
(b) gives {p, r} G E, so (M, -E'[M]) has an odd cycle V, {p, r}. This contradicts to (e). 

(g) Suppose p £ M (implying E has a hard self-loop {p,p}) and {p, q} is a soft edge in E. Properties (b) 
and (a) give respectively {p,q} G E and {q,p} G E, and furthermore both edges are soft. Applying (b) 
again gives that {p, p} £ E and this edge is soft. This contradicts to the assumption that {P, E) does not 
have soft self-loops. □ 

4.3 Constructing (n, U) 

In this section we complete the proof of Theorem 9 by constructing a pair of operations (n, U) for F that 
behaves as an STP on M and as a multi-projection (returning its two ai^guments in the same order) on M. 

Lemma 12. There exists an assignment a : M — ?> {— 1,+1} such that (i) cr{p) = —cr{q) for all edges 
{p, q} G E, and (ii) cr{p) = —(t{p) for all p G M. 

Proof. Let (Mi, £'1), . . . , {Mk,Ek) be the connected components of graph (M, E[M]). Consider the fol- 
lowing algorithm for constructing a: 

50 Set a{p) = ("unassigned") for all p e M. 

51 for i = 1, . . . , /c 

52 Choose pi G Mi. If a (pi) is assigned set cr(pj) = —a (pi), otherwise set a{pi) to — lor +1 arbitrarily. 

53 For p G Mi — {pi} set a{p) = {—l)''a{pi) where k is the length of a path from pi to p. (All paths 
from Pi to p must have the same parity since, by Lemma 1 1(d), there is no odd cycle in (Mj, Ei).) 

54 end for 
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It can be seen that the algorithm produces an assignment a with claimed properties. Showing prop- 
erty (i) is straightforward, let us prove that (ii) holds. If p and p belong to the same connected component, 
then, by Lemma 1 1(f), a path from ptop must be odd, so ct{p) = —a{p). Suppose that p € Mj and p € Mi 
where j < i, i.e. a{p) was assigned first. Let pi be the element of Mi chosen in step S2. We know that 
there is a path from pi to p of some length k. Lemma 1 1 (a) implies that there is a path from p to pi of the 
same length k. Therefore, pi € Mj, so a{pi) would have already been assigned in step S2 for component 
Mi and thus we would set a{pi) = —a{pi). The claim now follows from 

a{p) = (-l)^a(K) = -(-l)Vfe) = -(-!)'= • {-lfa{p) = -a{p) 

□ 

Given assignment a constructed in Lemma 12, we now define operations U : D as follows: 

• ar\a = aL\a = a for a^D. 

• If (a, 6) € M then a □ 6 and a U 6 are the unique elements of D satisfying {a n b, a \J b} = {a, b} 
and a{a nb,aUb) = +1. 

• If (a, 6) G M then aHb = a and aUb = b. 

Lemma 13. For any binary cost function / € F* and any x,y & dom/ there holds 

f{xny) + f{xUy)<f{x) + f{y) (4) 



Proof. Denote (o, a') = x n y and (6, 6') = x U y. We can assume without loss of generality that 
{x, y} 7^ {(a, a'), {b, b')}, otherwise the claim is straightforward. It is easy to check that the assumption 
has two implications: (i) a ^ b and a' ^ b'; (ii) {x, y} = {(a, b'), {b, a')}. 

If /(a, a') + f{b, b') = f{a, b') + f{b, a'), then (4) holds trivially. If f{a, a') + f{b, b') / /(a, b') + 
f{b, a'), then E contains at least one of the edges {(a, b), (a', 6')}, {(a, b), {b' , a')}. By Lemma 11(c) and 
Lemma 11(d), pairs (a, b) and (o', b') must either be both in M or both in M. In the former case (4) is a 
trivial equality from the definition of □ and U, so we assume the latter case. 

The definition of U and the fact that (a, a') = x n y and {b, b') = x U y imply that a{a, b) = 
(T{a', b') = +1. Thus, set E does not have edge ((a, 6), (a', b')), and therefore 

f{a,a') + f{b,b')<f{a,b') + f{b,a') 

which is equivalent to (4). □ 

In order to proceed, we introduce the following notation. Given a cost function / of arity m, we 
denote by V the set of variables corresponding to the ai^guments of /, with \ V\ = m. For two assignments 
x,y ^ we denote A(ic, y) = {i ^ V \ Xi yi} to be the set of variables on which x and y differ. 

Lemma 14. Condition (4) holds for any cost function f ^ T* and assignments x,y € dom/ with 
\A{x,y)\ < 2. 

Proof. If |A(a;, y)\ < 1 then {x r\ y,x L\ y} = {x, y}, so the claim is trivial. We now prove it in the 
case |A(a;, y)\ = 2 using induction on \V\. The base case \V\ = 2 follows from Lemma 13; suppose that 
\V\ > 3. Choose k € V — A{x,y). For simplicity of notation, let us assume that k corresponds to the first 
argument of /. Define cost function of |y| — 1 variables as 

g{z) = mm{u{a) + /(a, z)} Mz G D^^i'^i (5) 
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where u is the following unary cost function: u{a) = if a = = yk, and u{a) = C otherwise. Here C 
is a sufficiently large constant, namely C > f{x) + f{y). 

Let X and y be the restrictions of respectively x and ytoV — {k}. Clearly, g € T*, g{x) = f{x) < oo 
and g{y) = f{y) < oo. By the induction hypothesis 

gix ny)+ g{x Uy)< g{x) + g{y) = f{x) + f{y) (6) 

This implies that g{x Uy) < C, which is possible only if g{x Uy) = f{a, x n y) = f{x □ y) where 
a = Xk = Uk- Similarly, g{x \Jy) = f{a, xUy) = f{x U y). Thus, (6) is equivalent to (4). □ 

Lemma 15. Condition (4) holds for any cost function / € F* and any x,y dom/. 

Proof. We use induction on | A(a;, y)\. The base case | A(a;, y)\ < 2 follows from Lemma 14; suppose that 
|A(a;, y)\ > 3. Let us partition A{x, y) into thi^ee sets A, B, C as follows: 

A = {i £ A{x,y)\{xi,yi) e M, Xi = Xinyi, yi = XiUyi} 
B = {i e A{x,y) \ {xi,yi) e M, Xi = XiUyi, ?/i = Xj n yj} 
C = {i £ A{x,y)\{xi,yi) eM} 

Two cases are possible. 

Case 1 |A U C| > 2. Let us choose variable k £ A U C, and define assignments x',y' as follows: 
X ■ = = Xj = yi if Xi = yi, and for other variables 

Xi if i = k ( Xi if i = k 

ifie{AuC)-{k} y-=lyi if i e (Au C) - {k} 

Xi if i & B [ j/j if i € -B 

It can be checked that 

X r\y' = xHy xUy' = x' x' Uy = y' x' Uy = xUy 

Furthermore, A[x, y') = A(a;, y) — {k} and A{x' , y) = A{x, y) — {{A U C) — {A;}) so by the induction 
hypothesis 

f{x n y) + f{x') < fix) + f{y') (7) 

assuming that y' € dom/, and 

f{y') + f{x Uy)< fix') + f{y) (8) 
assuming that x' G dom/. Two cases are possible: 

• y' € dom/. Inequality (7) implies that x' G dom/. The claim then follows from summing (7) 
and (8). 

• y' ^ dom/. Inequality (8) implies that x' ^ dom/. Assume for simplicity of notation that k 
corresponds to the first argument of /. Define cost function of |y| — 1 variables 

g{z) = min{M(a) + /(a, z)} \/z € 

where u{a) is the following unary cost function: u{xk) = 0, u{yk) = C and u{a) = 2C for 
a ^ D — {xk,yk}- Here C is a sufficiently large constant, namely C > f{x) + f{y). 
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Let X, y, x', y' be restrictions of respectively x, y, x', y'toV — {k}. Cleai^ly, g GT* and 

9{y) = g{y') = u{yk) + f{yk,y) = f{y) + C {since {xk,y) = y' ^ domf) 

9{x) = fixk,x) = f{x) 

By the induction hypothesis 

g{x ny)+ g{x Uy) < g{x) + g{y) = f{x) + f{y) + C (9) 

We have g{x \Jy) < 2C, so we must have either g{x L\y) = f{xk,x U y) or g{x Uy) = f{yi:,x U 
y) + C = f{x Uy) + C. The former case is impossible since {x^^x U y) = x' ^ dom/, so 
g{x Uy) = f{x Uy) + C. Combining it with (9) gives 

g{xny) + f{xUy)<f{x) + f{y) (10) 

This implies that g{x Uy) < C, so we must have g{x n y) = f{xk, xHy) = f{x □ y). Thus, (10) 
is equivalent to (4). 

Case 2 \B\ > 2. Let us choose variable k G B, and define assignments x', y' as follows: = = Xj = 
Hi if Xi = Hi, and for other variables 



Ui ifi = k 
yt ifi e AUC 
ifi e B - {k} 



Hi ifi = k 

Xi ifieAuC y'i 
Xi ifieB- {k} 

It can be checked that 

x' r\y = xHy x' Uy = y' xUy' = x' xUy' = x Uy 

Furthermore, A{x',y) = A{x,y) — {k} and A{x,y') = A{x,y) — {B — {k}) so by the induction 
hypothesis 

fixny) + f{y')<f{x') + f{y) (11) 

assuming that x' £ dom /, and 

f{x')+f{xUy)<f{x) + f{y') (12) 
assuming that y' G dom/. Two cases are possible: 

• a;' € dom/. Inequality (11) implies that y' € dom/. The claim then follows from summing (11) 
and (12). 

• x' ^ dom/. Inequality (12) implies that y' ^ dom/. Assume for simplicity of notation that k 
corresponds to the first argument of /. Define function of |y | — 1 variables 

g{z) = min{n(a) + /(a, z)} € D^^^^^ 

where u{a) is the following unary term: u{yk) = 0, u{xk) = C and u{a) = 2C for a € D—{xk,yk}- 
Here C is a sufficiently large constant, namely C > f{x) + f{y). 

Let x,y,x' ,y' he restrictions of respectively x,y,x' ,y' to V — {k}. Clearly, 5 € T* and 

g{x) = g{x') = u{xk) + f{xk,x) = f{x) + C {since {yk,x) = x' dom f) 

9{y) = f{yk,y) = f{y) 
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By the induction hypothesis 



g{x ny)+ g{x Uy) < g{x) + g{y) = f{x) + f{y) + C (13) 

We have g{x Uy) < 2C, so we must have either g{x Uy) = f{yk,x U y) or g{x Uy) = f{xk,x U 
y) + C = f{x Uy) + C. The former case is impossible since {yk,x U y) = y' ^ dom/, so 
g{x Uy) = f{x Uy) + C. Combining it with (13) gives 

g{xny) + f{xUy)<f{x) + f{y) (14) 

This implies that g{x r\y) < C, so we must have g{x r\y) = f{yk, xHy) = f{x PI y). Thus, (14) 
is equivalent to (4). 

□ 



5 Discussion of the general case 

While the case of conservative finite-valued languages has now been completely characterized, the case 
of general conservative languages (i.e. languages that allow both finite and infinite costs and contain all 
finite-valued unary functions) remains open. We conjecture that the tractability criterion is the same as for 
finite-valued languages: F is tractable iff Gr does not contain a soft self-loop. The "only if" part is proven 
in Theorem 9(a). It remains to show that if Gr does not contain a soft self-loop, then T is tractable. We 
know that in that case F admits a multimoiphism, constructed in Section 4.3, that behaves as an STP on M. 

Below we review some known partial results related to the general case. These results use notions of 
polymorphisms and multimorphisms that generalise Definition 3. 

Definition 16. A mapping F : D, k > I is called a polymorphism of a cost function f : — > M_|_ 

if 

F{xi, ...,Xk) G dom/ Va^i, ... ,Xk e dom/ 

where F is applied component-wise. F is a polymorphism of a language F if F is a polymorphism of every 
cost function in F. 

Definition 17 ([22]). A mapping {Fi,...,Fk) : D'' Z)^ where Fi : ^ D, 1 < i < k, is a 
multimoiphism of a cost function f : — )• if 

k k 

^ f{Fi{xi, Xk)) < ^ f{xi) Vaji, . . . , ajfc e dom/ 

i=l i=l 

where Fi are applied coordinate-wise. {Fi, . . . ,Fk) is a multimorphism of a language F if {Fi, . . . ,Fk) is 
a multimorphism of every cost function in F. 

An STP, defined in Definition 3, is an example of a binary multimorphism. Note, if {Fi, . . . , F^) is a 
multimorphism of F, then each mapping Fi, 1 < i < k is a. polymorphism of F. 
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5.1 Boolean case (\D\ =2) 

This case has been completely characterized by Cohen et al. [22]. 
Theorem 18 ([ ]). Let T be a Boolean conservative language. 

1. Suppose r is finite-valued. If T admits an STP multimorphism then it is tractable, otherwise it is 



2. Suppose r is general-valued. IfT admits of the following as a multimorphism then it is tractable: 

• STP multimorphism 

• {Mj,Mj,Mn) multimorphism where Mj and Mn are the unique majority and minority opera- 
tions defined by Mj{a, a, b) = Mj{a,b,a) = Mj{b,a,a) = a, Mn{a,a,b) = Mn{a,b,a) = 
Mn{b, a, a) = b. 

Otherwise T is NP-hard. 
5.2 Non-Boolean case (ID I > 2) 

Dichotomy results have been established for some special cases by Bulatov [ii] and Takhanov [27]. 

Theorem 19 ([ i j.]). Let T be a crisp language containing all possible {0, oo}-valued unary cost functions. 
Suppose that for every subset B (1 D with \B\ = 2 there exists polymorphism ofT whose restriction 
F^\b to B satisfies one of the following conditions: 

• F^\b is a binary semilattice operation. 

• F^\b is the ternary majority operation. 

• F^\b is the ternary minority operation. 
Then T is tractable; otherwise T is NP-hard. 

Theorem 20 ([ ]). Let T be a language containing some crisp functions and additionally all possible 
finite-valued unary functions, and let M be a subset of P with the property (a, b) and (6, a) are either both 
in M or both not in M. A ternary operation m : D is called arithmetical on M = P — M if 

m{a, b, c) G {a, b, c} for all a,b,c £ D and m{a, a, b) = m{b, a, a) = m{b, a, b) = bfor all (a, 6) G M 

IfT admits an STP multimorphism on M and an arithmetical polymorphism on M then T is tractable. 
Otherwise T is NP-hard. 

One can show that the absence of soft self-loops in Gr implies Takhanov's "necessary local conditions" 
for the set of polymorphisms of F (details are omitted). Takhanov's results then imply the existence of an 
arithmetical polymorphism on M. 

Finally, we mention another special case of tractable conservative languages. 

Theorem 21 ([22]). Let Mji, Mj2, and Mn^ be the ternary functions defined on D as follows: 



NP-hard. 




= < 



V 



X 



y 



otherwise 



X if y 



z and z X 



Mns{x,y,z) 



= < 



y if X 



z and z ^ y 



z otherwise 



IfT admits multimorphism {Mji,Mj2, Mj^) then F is tractable. 
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